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Abstract 

We introduce and analyze a random tree model associated to Hoppe's urn. The tree is 
built successively by adding nodes to the existing tree when starting with the single root node. 
In each step a node is added to the tree as a child of an existing node where these parent 
nodes are chosen randomly with probabilities proportional to their weights. The root node 
has weight 1? > 0, a given fixed parameter, all other nodes have weight 1. This resembles the 
stochastic dynamic of Hoppe's urn. For # = 1 the resulting tree is the well-studied random 
recursive tree. We analyze the height, internal path length and number of leaves of the Hoppe 
tree with n nodes as well as the depth of the last inserted node asymptotically as n — > oo. 
Mainly expectations, variances and asymptotic distributions of these parameters are derived. 
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1 Introduction 

We consider a random tree model associated and derived from Hoppe's urn: In Hoppe's urn, see 
[5], there initially is one red ball. In each step one of the balls is drawn from the urn independently 
with probabilities proportional to the weights of the balls. The red ball has weight ■d > 0, all other 
balls have weight 1. Here the parameter i3 > is given and fixed throughout the evolution of the 
urn. When a ball is drawn it is placed back to the urn together with a ball of the same color 
unless the ball drawn is the red ball. In this case the red ball is placed back together with a ball 
of a new color not yet being present in the urn. This model has been introduced for deriving and 
interpreting the Ewens sampling formula and is related to the infinite alleles model in population 
genetics, the parameter & > modeling the mutation rate. The decomposition of the balls into 
groups of the same color (neglecting the red ball) leads to a Chinese restaurant process, the (0, ■d) 
seating plan, see Pitman Q31 page 61]. 

A random tree model, which we subsequently call Hoppe tree, is associated to the Hoppe urn 
as follows: The balls in the urn are represented by nodes in the tree. Each node v is child of 
node w in the tree if the ball corresponding to v was placed first in the urn together with the ball 
corresponding to w when the w-ball was drawn. In other words the tree grows successively: In 
each step a node is chosen independently and with probability proportional to the weights of the 
nodes (the root having weight §, all other nodes having weight 1) and a new node is added as 
child of the chosen node. For d — 1 this is a well-known and well-studied random tree model, the 
random recursive tree, see, e.g., Smythe and Mahmoud [IS] . 

The aim of the present note, which is based on the first author's master's thesis [10], is to 
study asymptotic properties of the Hoppe tree as its size n tends to infinity. In particular we are 
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interested in the deviation from the random recursive tree model caused by the perturbation of 
the root weight from # = 1 to i? ^ 1. As characteristics of the tree we study the depth of 

the n-th inserted node in the tree, defined as its distance to the root of the tree. Furthermore the 

($) .... . ($) 

tree's height W n is studied, which is the maximal depth maxi<i<n D\ , its internal path length 

i<i< n and the number of leaves of the tree. A node is a leaf if it has no child in 
the tree. Our results show, that the perturbation of the root weight does typically not affect the 
first order behavior of the quantities, an exception being the variance and limit law of the internal 
path length. Hence, we give second order expansions to reveal the asymptotic dependence on d. 

The paper is organized as follows: In the second section the results on the four quantities 
mentioned above are stated, the proofs being collected in the third section. 
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2 Results 

In this section the results on depth, height, internal path length and number of leaves are stated. 
Throughout the parameter i? > is arbitrary and fixed. All asymptotic statements as well as the 
use of the Bachmann-Landau symbols are understood as n, the number of nodes in the Hoppe 
tree, tends to infinity. Moreover, we use the digamma and trigamma functions = ^logT and 

= ip"l°gr respectively. We use, see, e.g., [TJ 6.3. and 6.4.], 

n— 2 oo , i \ 2 

i=l fe=l v ' 

Depth of a node 

For the depth D n we have a distributional representation as sum of independent Bernoulli vari- 
ables: 

Theorem 2.1. For the depth D„ of the n-th node in a Hoppe tree we have for all n > 2 

n-2 
i=l 

where B±,... B n -2 are independent and P(Bi = 1) = 1 — P(Bi = 0) = for i — 1, . . . , n. 

Asymptotic results can hence easily be obtained, e.g., the following. We denote by n(A) the 
Poisson distribution with parameter A > 0, by d-rv the total variation distance between probability 

measures, by — ^> convergence in distribution and by Af(0, 1) a real random variable with the 
standard normal distribution. 
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Corollary 2.2. The depth of the n-th node in a Hoppe tree satisfies 

n-2 

E[Z>W] = i + V —.= logn - + 1) + 1 + o(l), 

z — ' 17 + 1 

2 — 1 

n— 2 1 n— 2 / , \ 2 
Var(^)) = V-^- V(-^ 

2—1 i— 1 v 



/Var(£)W) 
d TV (£( J DW),n(E[i?W]))=0( i ^ 



logn- + 1) - + 1) +o(l), 

AA(0,1), (1) 



Height of the Hoppe tree 

The height H n of the Hoppe tree can be analyzed by drawing back to results on the height for 
random recursive trees, see Addario-Berry and Ford [2J, in particular they show that 

M n := E[ff «] = e log n - ~ log log n + 0(1) (2) 
as n — > oo. We transfer their results to arbitrary i9 > 0: 

Theorem 2.3. For £/ie height H„ of a Hoppe tree with n nodes we have: For all a < p < ^ 
there exist constants C a , Cp > such that for all t > 

P (iff ) - M n >t)< Cpe-P\ P (ffW _ Mn < -f) < C Q e- a *. 

T/ie constant Cp can be chosen independently of 

Corollary 2.4. T/ie height of a Hoppe tree with n nodes satisfies 

E[HW] = elogn - 5 bglogn + 0(1), Var(ijW) = 0(1). 
Number of leaves 

The number of leaves in a Hoppe tree is related to a two-color urn model. We obtain: 
Theorem 2.5. Let L n be the number of leaves in a Hoppe tree with n > 2 nodes. Then 

F(\L n - E[L n ] \>t)< 2exp — - for all t > 0, n > 1, (3) 



71 + + 1 



Atv(o,i). 
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Internal path length 

Moments of the internal path length can be obtained from our results on the depths of nodes. 
Theorem 2.6. The internal path length I„ of a Hoppe tree with n nodes satisfies 

n—l 

Wn ] ] = + n - 1) 1 ■ = nl °E n - + 1)" + o(n), 

1=1 



Moreover, 

f4* } -E[/W] 



i? + n - 1 



is a zero-mean martingale. 

The internal path length can be analyzed either via martingale methods or the recursive dis- 
tributional decomposition explained in Figure 1 which allows to apply the contraction method. 

Theorem 2.7. The internal path length 7„ of a Hoppe tree with n nodes satisfies 

n 

for a non-degenerate random variable X^\ where the convergence holds almost surely and in L^. 
The distribution C(X^) is the only integrable solution of the distributional fixed point equation 

X W £ (1 _ B)X W + BX {1) + Blog(B) + (1 - B) log(l - B) + 1, 

where X^\ X^ and B are independent, B has the betafl,^) distribution and X^' is distributed 



Theorem 2.8. The limit distribution C{X^) in Theorem 2.1 has a Lebesgue density f#, which is 
in the Schwartz space on R, i.e., is infinitely differentiable and together with all its derivatives 
rapidly decreasing. 



3 Proofs 

In the analysis of the tree below the random decomposition of the Hoppe tree shown in Figure 
1 is used: The tree is decomposed into the subtree of the second inserted node (left dashed box) 
and the remaining part of the tree (right dashed box) . The stochastic dynamic of the Hoppe tree 
with parameter $ implies that conditioned on the size N n of the subtree of the second inserted 
node this subtree is a random recursive tree, whereas the remaining part is a Hoppe tree with 
parameter & and size n — N n . Moreover, conditional on N n these two trees are independent. We 
have the asymptotic behavior 

— - — > B almost surely (n — >■ oo) (4) 

where B has the beta(l,i?) distribution having Lebesgue density x H> #(1 — a;)" 5-1 , x £ [0, 1], see 
Donnelly and Tavare [6]. 
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Figure 1: A Hoppe tree with 11 nodes. The decomposition into the subtree rooted at node labelled 
2 and the remaining part of the tree is indicated in dashed boxes. 



Proof of Theorem \2.1\ We calculate the depth of a node by counting its ancestors in the tree. 
We have = Y^i=i ~^A t „, where A+ j denotes the event, that node i is an ancestor of node j, 
i < j. Cleary P(yl l n ) = 1. Moreover, V(A i i+1 ) = for i > 2 by definition of the Hoppe tree. 

For general i < n let £j in be the number of descendants of node i in a Hoppe tree with n nodes, 
i.e. the size of subtree rooted in i minus 1. By the dynamics of the Hoppe tree we have 

HAiA^n-x) = Y+T-r (5) 
We calculate E[& , n -i] by the recursion 

1 + Efe,„_ 2 ] 



E[^,„_i] = E[^,„- 2 + 1^,„_J = Efe,„- 2 ] 
This yields E[£f jn _i] = ^jrfEf — 1 an d therefore, by equation (|5 



tf + n- 3 



nA i , n ) = JT -- 1 . (6) 

It remains to show that A2, n , • ■ ■ , -<4n-i,n are independent. Note that for i < j, Aij only depends 
on where the nodes i + l,...,j are inserted. Therefore, we get for all 2 < k < n — 2 and 
2 < i\ < ■ ■ ■ < ik < n — 1 independence of Ai u i 2 , Ai 2i i 3 , . . . , Ai k<n . This yields 

p ^fl i- ,j =¥(A iui2 nA i2>i3 n...nA ik , n ) 

= Y(A iui2 )-F(A i2M )-...-Y(A ik , n ) 

k 

where (|6j) is used in the last equation. With B t = 1^. n this yields the assertion. 

For related reasoning in the analysis of the depth in other random tree models see Dobrow and 
Smythe [5]. □ 



Proof of Corollary \2.2\ Theorem 2.1 implies expectation and variance of D^f\ Moreover, by 
Lindeberg's version of the central limit theorem (CLT) we obtain the CLT for in |lj) and by 
Equation (1.23)] we get d T v(^pS ?) ) ) H(E[D^ } ])) = 0(1/ log n). □ 

Proof of Theorem \2.3\ Addario-Berry and Ford showed in [2J Corollary 1.3] that the expected 
height Hn^ of a random recursive tree satisfies and that for all d < ^ there exists a constant 
C = C(c') such that for all n > 1 and t > 

F(\HW-E[HW}\>t)<Ce- c,t . 

Recall that in a Hoppe tree with n > 1 nodes and parameter § > by N n the size of the subtree 
rooted in node 2 is denoted and that this subtree, conditioned on its size, is a random recursive 
tree. 

By an obvious coupling argument between Hoppe trees for different parameters $ we have 
Hn =4 H^ 2 ^ for all i?i > $2, where ^ denotes stochastic domination. In the extremal case $ = 
(for definition of the tree start with the root and one child) we obtain Hn =4 = 1 + ff^-i ^ 

Tir/i n'nf 



1 + H n ' . Therefore, we get P [H n ' - M n >tj < Ce~ c % C = Ce c , using the result for random 
recursive trees. 

In order to proof the left tail inequality let Hjp be the height of the subtree rooted in node 

2. From H„ > we obtain for all i > and a > (later we have to restrict to a as in the 
Theorem) 

P(ffW - M n < -t) < P({flj£ - M„ < -t} n {7V„ > e- at n}) 

+P({ifg - M n < -t} n {7V„ < e- Qt n}), 



< P(fff;ia* n1 - M n < -t) + V(N n < e~ at n). 

^ n] 
I e at n I J 



Again, by using the result for random recursive trees and M n — E[iJr^l ot -, ] = eat + O(l) we 



obtain for a — d j (1 + ed) a constant C\ such that 

F(H™ atn] - M n < -t) < Cie-^ 1 -™'* = C ie - Q *. 

Hence we have such an upper bound for all a < l/(3e). To get an upper bound for P(N n < e~ at n) 
note that for all 1 < k < n — 1 

nMn - k) -{k-l) (,? + l)...( 1 ? + n-2) ' 

This yields for all e G (0, 1) that 

P(JV„ < en) < 3(i? + l)e. 

Therefore, 

P(i?W - M n < -i) < (Ci + 3(i? + l))e" Q *. 
This implies the assertion. □ 

Proof of Corollary \2-4\ By Theorem |2.3| we have 

E[|ffW - M B |] = Oil). 

Consequently, E[H n ^] = M n + O(l) = elogn - § loglogn + 0(1). 
Moreover, the tail bound from Theorem |2.3| implies 



□ 



Var(//W)<E[( J ffW-M„) 2 ]=0(l). 



G 



1. if the parent of node n was no leaf at time n — 1, 
0, otherwise. 



Proof of Theorem \2.5\ We have L„ = + Y n , where 
Therefore, for n > 2, almost surely 

e[l^ 1 \l^,...,lW] = L w + i 



L 



i9 + n-2 



With 



i? + n - 1 «? + n - 1 " 
i?(n-l) 



+ 1. 



(7) 



the sequence (^n) n >2 is a zero-mean martingale and 

n — 1 $(n — 1) i? + n — 1 



With the representation 



2(tf + ra-2) 



X, - = (tf + * - 2)0^ - E[yj) + if\ - E[L^], i > 3 

we have Z t < X; - < Z,- + + i - 2 where Z< = Lf\ - E[Lf\] ~ + i - 2)E[y]. By a 

version of the Azuma-Hocffding inequality with conditional ranges, see, e.g., [121 Theorem 3.14], 
we have for all t > 

/ 2t 2 
n\X n \>t)<2ex P (- ^ {i + ^_ 2y 

Using that the sum in the denominator of the latter exponent is bounded by (n + $ — 2) 3 /3 + (n + 
■d — 2) 2 and the scaling in (JtJ) this implies the bound pi). 

In order to compute VarfX^) we have X n = |±^5§X„_i + (<& + n - 2)(Y n - E[Y n ]). Hence, 



O 



■ {■») 



E[X 2 n ] = 



fl + n- 2 
§ + n- 3 



+ 2 



t? + n-3 



E[x n _!(y„-E[y„])] 



+ (i9 + n-2) 2 Var(F„). 
Using E[X„_i] = we have 



(8) 



E[x„_ 1 (y n -E[y„])]=E[x„_ 1 E[y n |4 , ' ) ,---,ir-i]]=E 

l 



1 



■tf+n-2 



(tf + n-2)(tf + n-3) 



Moreover, E[F n ] = 1 - = \ + O (l/n 2 ) and Var(r„) = \ + O (l/r 



Solving (|8|) by the substitution Q„ = |±^E§E[J^] yields 

i? + n - 1 



Var(iW) 
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To obtain the CLT for L n the representation 



E[L 



X„ 



7 



allows to apply a general martingale CLT, see, e.g., Hall and Heyde [H Theorem 3.2]. It is sufficient 
to show that ^ 

:= — - (Xi-Xi-i), n>3,3<i<n, 

v /Var(X„) 

satisfies 



(a) max |A nii | 

3<i<n 



max A 

3<z<n 



< OO. 



o, (b) J2 A L- J ^ 1 > ( c ) ™ff E 

3<i<n ~~ 

For (a) and (c) we have |X* - = |if ) - E[L<f } ] + (i? + i - 3)(Yj - E[Yj])| < ?9 + 2n + 3 for 

i < n and Var(X„) = (0 + n - l) 2 Var( J L^ ) ) - f^. Hence, |A n>i | < (2n + + 3)/ v /Var(X„) a.s., 
which yields that max, |A n i | — > and that E [maxj A 2 J is bounded in n. 

To compute A 2 ^ note that by kty and the Borel-Cantelli Lemma we have (iffl — E[L^] ) /n — > 
almost surely. Hence, for all n > 37 

n n ~ n 



i=3 



i=3 



1 " 



(9) 



By (L^ — E[i„ ])/n— >■(), Var(X„) ~ and the Cesaro mean we have for the first summand in 



i=3 \ 



and for the second summand in (|9| 



Var(X„ 



J2(L { r E[l\ '])(& + i - 3)(15 - E[F,]) 



< 



2n 2 (^ + r7, + 3) 1 
Var(X„) n 



E 



LW-JL[L?>] 



Because E[^] = \ + O (£) we have (Y* - E[Yj]) 2 = | + (£) a.s. and therefore for the last 
summand in a.s. 

n 



This implies ^A^, -A 1. 



□ 



Proof of Theorem \2.6\ For j > 1 let .7-j = a{D ( f\ . . . , D^). By the dynamics of the Hoppe 
tree we have almost surely 



E[^|7 n -i] 



(9 



-<M" + 1) + £ ^<A'" + D-1 + ^e», do) 



i? + n - 2 
Wit .1-t- 1 "' 



i=2 



d + n-2 



Consequently, K[J^>\F n -i] = 1^ + E[D { n ) \J 7 n _ 1 ] = + 1 almost surely. Therefore, 



#+n-l rW 



+ n 



1 1 

n - 1 " ^ i? + i 

i— 1 
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is a zero-mean martingale and E[/,V 



(0- 



i 



The calculations to obtain the expansion for the variance of I n can be done similarly to the 

□ 



calculations in the proof of Theorem 2.5 for details we refer to the master's thesis [10] 



Proof of Theorem \2. 7| To apply a martingale convergence theorem it is sufficient to have a 
bound on the variance of the martingale uniformly in n. Hence, our expansion of Var(4* 3 ) m 



Theorem 2.6 is sufficient to imply almost sure and L2 converge nce of the martingale there, which 



also applies to the slightly different scaling of /„ in Theorem 2.7 By our decomposition of the 
Hoppe tree, see Figure 1, we obtain the recurrence 

m A rW , 7(i) , at 

1 n ~ 1 n-N n ' l N„ ' iv ™' 



where (Ij)j>i, (^^)j>i an d N n are independent and (ij )j>i is distributed as (jj 1 )j>i- For 
the scaling, 



X W ■- 



- n log n 



(11) 



we obtain 



n-N n 



-4-iv„ + — XnI + - [ Nn log I — ] + (n - N n ) log 



n - N, n 



N n ), (12) 



with independence and distributional conditions as in (ml This suggests that the limit jM of 
(X n ^) n >i should satisfy the recursive distributional equation 



X W) B )j{0) + BX (i) + B log ( B ) + (1 - B) log(l - B) + B, 



(13) 



where and B are independent, and B has the beta(l,$) distribution. Note that X^ 1 ' 

is the limit distribution of the internal path length of the random recursive tree, that has been 
obtained by martingale methods by Mahmoud [H] and by the contraction method by Dobrow 
and Fill [4|. In particular, in [4] it is shown that (Xn ^)n>i convergence to its limit X^ in the 
minimal ^2 metric, i.e., weakly and with second moments. This allows us to write the recurrence 



( 12 I in the form 



xW£a^x^2 n +&w 



with coefficients 



h (n) 



N„ 



X 



(i) 



- iV„log 
n 



Nn 
n 



+ {n-N n ) log 



N r . 



Hence we have convergence of the coefficients to the corresponding quantities in the recursive 
distributional equation (13 1 in £1, £2, in fact in any £ p , p > 1. This allows to apply general 



convergence theorems in the framework of the contraction method, see Rosier [151 Theorem 3] 
and Neininger and Riischendorf |13[ Theorem 4.1]. In particular, one can first apply Theorem 4.1 
in [13] with the choice of s = 1 there: This implies convergence in distribution of X n ^ to X^\ 
where X^ is the unique integrable solution of (13), and convergence of the expectations. With 



this knowledge on the expectation, which, of course, is also covered by our explicit formula for 
E[J„ ], one can apply either Theorem 4.1 in [13 with the choice of s = 2 or Theorem 3 in [T5] to 
also obtain convergence of the second moments. 

Alternatively to applying the contraction method we could as well use the almost sure conver- 
gence of X n ^ from the martingale argumen t to gether with the almost sure convergence of N n /n 



in Ml to argue that the limit X^ satisfies (13). 



□ 
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Proof of Theorem \2.8\ For the characteristic function tp&(t) := E[exp(itX^'] of the re- 

cursive distributional equation in Theorem |2 . 7| implies 

\Mt)\< [ \<pi(xt)\\<pt(Q. - x)t)\#(l - xf- 1 fa, tel. 

Jo 

We can apply the techniques of Fill and Janson [7] to show that this relation together with an 
initial bound on \ip#\ allows to show that \ is rapidly decreasing. The details are carried out in 
the master's thesis [10] - Since Fourier transform is an automorphism on the Schwartz space, this 
implies the assertion. □ 
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